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<110> APPLICANT: Max-Planck-Gesellschaf t e.V. 

<120> TITLE OF INVENTION: Protein expression and structure solution using 

specific fusion vectors 
<130> FILE REFERENCE: ST010209-EPA 
<140> CURRENT APPLICATION NUMBER: US/10/044 , 303 
<141> CURRENT FILING DATE: 2002-01-11 
<160> NUMBER OF SEQ ID NOS : 3 
<170> SOFTWARE: Patentln Ver. 2.1 
<210> SEQ ID NO: 1 
<211> LENGTH: 765 

<212> TYPE: PRT ' 
<213> ORGANISM: Artificial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION: Description of Artificial Sequence: Partial myosin 
sequence of Dictyostelium; Component (1) of the 
recombinant protein M761-2R R238E 
<400> SEQUENCE: 1 

Met Asp Gly Thr Glu Asp Pro He His Asp Arg Thr Ser Asp Tyr His 

1 5 10 15 

Lys Tyr Leu Lys Val Lys Gin Gly Asp Ser Asp Leu Phe Lys Leu Thr 

20 25 30 

Val Ser Asp Lys Arg Tyr He Trp Tyr Asn Pro Asp Pro Lys Glu Arg 

35 40 45 

Asp Ser Tyr Glu Cys Gly Glu He Val Ser Glu Thr Ser Asp Ser Phe 

50 55 60 

Thr Phe Lys Thr Val Asp Gly Gin Asp Arg Gin Val Lys Lys Asp Asp 
65 70 75 ^ ~ 80 

Ala Asn Gin Arg Asn Pro He Lys Phe Asp Gly Val Glu Asp Met Ser 

85 90 95 

Glu Leu Ser Tyr Leu Asn Glu Pro Ala Val Phe His Asn Leu Arg Val 

100 105 HO 

Arg Tyr Asn Gin Asp Leu He Tyr Thr Tyr Ser Gly Leu Phe Leu Val 

115 120 125 

Ala Val Asn Pro Phe Lys Arg He Pro lie Tyr Thr Gin Glu Met Val 

130 135 140 

Asp He Phe Lys Gly Arg Arg Arg Asn Glu Val Ala Pro His He Phe 
145 150 155 160 

Ala He Ser Asp Val Ala Tyr Arg Ser Met Leu Asp Asp Arg Gin Asn 

165 170 175 

Gin Ser Leu Leu He Thr Gly Glu Ser Gly Ala Gly Lys Thr Glu Asn 

180 185 1 190 

Thr Lys Lys Val He Gin Tyr Leu Ala Ser Val Ala Gly Arg Asn Gin 
195 200 205 
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67 Ala Asn Gly Ser Gly Val Leu Glu Gin Gin He Leu Gin Ala Asn Pro 

68 210 215 220 

70 He Leu Glu Ala Phe Gly Asn Ala Lys Thr Thr Arg Asn Asn Asn Ser 

71 225 230 235 240 

73 Ser Arg Phe Gly Lys Phe He Glu He Gin Phe Asn Ser Ala Gly Phe 

74 245 250 255 

76 He Ser Gly Ala Ser He Gin Ser Tyr Leu Leu Glu Lys Ser Arg Val 

77 260 265 270 

79 Val Phe Gin Ser Glu Thr Glu Arg Asn Tyr His He Phe Tyr Gin Leu 

80 275 280 285 

82 Leu Ala Gly Ala Thr Ala Glu Glu Lys Lys Ala Leu His Leu Ala Gly 

83 290 295 300 

85 Pro Glu Ser Phe Asn Tyr Leu Asn Gin Ser Gly Cys Val Asp He Lys 

86 305 310 315 ~ 320 

88 Gly Val Ser Asp Ser Glu Glu Phe Lys He Thr Arg Gin Ala Met Asp 

89 325 330 335 

91 He Val Gly Phe Ser Gin Glu Glu Gin Met Ser He Phe Lys He He 

92 340 345 350 

94 Ala Gly He Leu His Leu Gly Asn He Lys Phe Glu Lys Gly Ala Gly 

95 355 360 365 

97 Glu Gly Ala Val Leu Lys Asp Lys Thr Ala Leu Asn Ala Ala Ser Thr 

98 370 375 380 

100 Val Phe Gly Val Asn Pro Ser Val Leu Glu Lys Ala Leu Met Glu Pro 

101 385 390 395 400 

103 Arg He Leu Ala Gly Arg Asp Leu Val Ala Gin His Leu Asn Val Glu 

104 405 410 415 

106 Lys Ser Ser Ser Ser Arg Asp Ala Leu Val Lys Ala Leu Tyr Gly Arq 
1° 7 420 425 430 

109 Leu Phe Leu Trp Leu Val Lys Lys He Asn Asn Val Leu Cys Gin Glu 

110 435 440 445 

112 Arg Lys Ala Tyr Phe He Gly Val Leu Asp He Ser Gly Phe Glu He 

113 450 455 460 

115 Phe Lys Val Asn Ser Phe Glu Gin Leu Cys He Asn Tyr Thr Asn Glu 

116 465 470 475 ' 480 
118 Lys Leu Gin Gin Phe Phe Asn His His Met Phe Lys Leu Glu Gin Glu 
H9 485 490 495 

121 Glu Tyr Leu Lys Glu Lys He Asn Trp Thr Phe He Asp Phe Gly Leu 

122 500 505 510 

124 Asp Ser Gin Ala Thr He Asp Leu He Asp Gly Arg Gin Pro Pro Gly 

125 515 520 525 

127 He Leu Ala Leu Leu Asp Glu Gin Ser Val Phe Pro Asn Ala Thr Asp 

128 530 535 540 

130 Asn Thr Leu He Thr Lys Leu His Ser His Phe Ser Lys Lys Asn Ala 

131 545 550 555 ' ' 560 

133 Lys Tyr Glu Glu Pro Arg Phe Ser Lys Thr Glu Phe Gly Val Thr His 

134 565 570 575 

136 Tyr Ala Gly Gin Val Met Tyr Glu He Gin Asp Trp Leu Glu Lys Asn 

137 580 585 590 

139 Lys Asp Pro Leu Gin Gin Asp Leu Glu Leu Cys Phe Lys Asp Ser Ser 
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173 <210> SEQ ID NO: 2 

174 <211> LENGTH: 1016 

175 <212> TYPE: PRT 

176 <213> ORGANISM: Artificial Sequence 

178 <220> FEATURE: 

179 <223> OTHER INFORMATION: Description of Artificial Sequence: Whole sequence 

180 of recombinant protein M761-2R R238 E 

182 <220> FEATURE: 

183 <223> OTHER INFORMATION: The protein comprises as component (1) the aa 

184 sequence of myosin II motor domain of 

185 Dictyostelium, a three aa linker region and the 

186 a-actinin aa sequence 

188 <400> SEQUENCE: 2 

189 Met Asp Gly Thr Glu Asp Pro He His 

190 1 5 

192 Lys Tyr Leu Lys Val Lys Gin Gly Asp 

193 20 ~ ^ 25 

195 Val Ser Asp Lys Arg Tyr He Trp Tyr 

196 35 40 

198 Asp Ser Tyr Glu Cys Gly Glu He Val 

199 50 55 

201 Thr Phe Lys Thr Val Asp Gly Gin Asp 

202 65 70 

204 Ala Asn Gin Arg Asn Pro He Lys Phe 

205 85 

207 Glu Leu Ser Tyr Leu Asn Glu Pro Ala 

208 100 105 
210 Arg Tyr Asn Gin Asp Leu He Tyr Thr 



Asp Arg Thr 
10 

Ser Asp Leu 

Asn Pro Asp 

Ser Glu Thr 
60 

Arg Gin Val 
75 

Asp Gly Val 
90 

Val Phe His 
Tyr Ser Gly 



Ser Asp 

Phe Lys 
30 

Pro Lys 

45 
Ser Asp 

Lys Lys 

Glu Asp 

Asn Leu 
110 
Leu Phe 



Tyr His 

15 
Leu Thr 

Glu Arg 

Ser Phe 

Asp Asp 
80 

Met Ser 
95 

Arg Val 
Leu Val 
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285 Asp Ser Gin Ala Thr lie Asp Leu lie Asp Gly Arg Gin Pro Pro Gly 

286 515 520 525 

288 lie Leu Ala Leu Leu Asp Glu Gin Ser Val Phe Pro Asn Ala Thr Asp 

289 530 535 540 

291 Asn Thr Leu lie Thr Lys Leu His Ser His Phe Ser Lys Lys Asn Ala 

292 545 550 555 " 560 

294 Lys Tyr Glu Glu Pro Arg Phe Ser Lys Thr Glu Phe Gly Val Thr His 

295 565 570 575 

297 Tyr Ala Gly Gin Val M$t Tyr Glu He Gin Asp Trp Leu Glu Lys Asn 

298 580 585 590 

300 Lys Asp Pro Leu Gin Gin Asp Leu Glu Leu Cys Phe Lys Asp Ser Ser 

301 595 600 605 

303 Asp Asn Val Val Thr Lys Leu Phe Asn Asp Pro Asn He Ala Ser Arg 

304 610 615 620 

306 Ala Lys Lys Gly Ala Asn Phe He Thr Val Ala Ala Gin Tyr Lys Glu 

307 625 630 635 ~ 640 

309 Gin Leu Ala Ser Leu Met Ala Thr Leu Glu Thr Thr Asn Pro His Phe 

310 645 650 655 

312 Val Arg Cys He He Pro Asn Asn Lys Gin Leu Pro Ala Lys Leu Glu 

313 660 665 670 

315 Asp Lys Val Val Leu Asp Gin Leu Arg Cys Asn Gly Val Leu Glu Gly 

316 675 680 ~ 685 

318 He Arg He Thr Arg Lys Gly Phe Pro Asn Arg He He Tyr Ala Asp 

319 690 695 700 

321 Phe Val Lys Arg Tyr Tyr Leu Leu Ala Pro Asn Val Pro Arg Asp Ala 

322 705 710 715 720 

324 Glu Asp Ser Gin Lys Ala Thr Asp Ala Val Leu Lys His Leu Asn He 

325 725 730 735 

327 Asp Pro Glu Gin Tyr Arg Phe Gly He Thr Lys He Phe Phe Arg Ala 

328 740 745 750 

330 Gly Gin Leu Ala Arg He Glu Glu Ala Arg Glu Gin Arg Leu Gly Ser 

331 755 760 765 

333 Glu Gin Thr Lys Ser Asp Tyr Leu Lys Arg Ala Asn Glu Leu Val Gin 

334 770 775 780 

336 Trp He Asn Asp Lys Gin Ala Ser Leu Glu Ser Arg Asp Phe Gly Asp 

337 785 790 795 800 

339 Ser He Glu Ser Val Gin Ser Phe Met Asn Ala His Lys Glu Tyr Lys 

340 805 810 ^ 815 

342 Lys Thr Glu Lys Pro Pro Lys Gly Gin Glu Val Ser Glu Leu Glu Ala 

343 820 825 830 

345 He Tyr Asn Ser Leu Gin Thr Lys Leu Arg Leu He Lys Arg Glu Pro 

346 835 840 845 

348 Phe Val Ala Pro Ala Gly Leu Thr Pro Asn Glu lie Asp Ser Thr Trp 

349 850 855 860 

351 Ser Ala Leu Glu Lys Ala Glu Gin Glu His Ala Glu Ala Leu Arg He 

352 865 870 875 880 

354 Glu Leu Lys Arg Gin Lys Lys He Ala Val Leu Leu Gin Lys Tyr Asn 

355 885 890 * 895 

357 Arg lie Leu Lys Lys Leu Glu Asn Trp Ala Thr Thr Lys Ser Val Tyr 




Use of n and / or Xaa has been detected in the 
Sequence Listing. Review the Sequence Listing 
to ensure a corresponding explanation is present 
in the <220> to <223> fields of each sequence 
using n or Xaa. 
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L:10 M:270 C: Current Application Number differs, Replaced Current Application Number 
L:ll M:271 C: Current Filing Date differs, Replaced Current Filing Date 
L:424 M:258 W: Mandatory Feature missing, <221> not found for SEQ ID# : 3 
L:424 M:258 W: Mandatory Feature missing, <222> not found for SEQ ID# : 3 
L:424 M:341 W: (46) "n" or "Xaa" used, for SEQ ID# : 3 
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